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,— I Abstract We investigate a special part of the human immune system, namely the activation of T-Cells, 

^^ using stochastic tools, especially sharp large deviation results. T-Cells have to distinguish reliably between 

CN foreign and self peptides which are both presented to them by antigen presenting cells. Our work is based on 

5—1 a model studied by Zint, Baake, and den Hollander in |[23J . and originally proposed by van den Berg, Rand, 

and Burroughs in ll20l . We are able to dispense with some restrictive distribution assumptions that were 
used previously, i.e. we establish a higher robustness of the model. A central issue is the analysis of two 
^f) new perspectives to the scenario (two different quenched systems) in detail. This means that we do not only 

T-H analyse the total probability of a T-Cell activation (the annealed case) but also consider the probability of an 

^_^ activation of one certain T-Cell type and the probability of a T-Cell activation by a certain antigen presenting 
^> cell (the quenched cases). Finally, we see analytically that the probability of T-Cell activation increases with 

^ the number of presented foreign peptides in all three cases. 

(~| Keywords Immune system; T-Cell activation; large deviations; cross-reactivity 
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,__! 1 Introduction and model setting 

> 

\^ In the present paper we analyze a stochastic model for T-Cell activation that was introduced by van den 

OO Berg et al. M20II19I and later developed and studied by Zint et al. Il23l . While we allow for a slightly more 

general model setting, the main new contribution is the analysis of some different experimental settings that 
correspond mathematically to various conditional probabilities. This is explained in detail below. 
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5-H the task to recognize foreign antigens against a noisy background of the body's own antigens. Any substance 

which is able to elicit an immune response is called antigen (from the term antibody generating). T-Cells 
do not react with free antigens present in the body, but only with antigens presented by antigen presenting 
cells (APCs). APCs collect material in the body, internaUze it and split it up in peptides which are afterwards 
presented on the surface of the AFC. 

During a so called immunological synapse, a bond between a T-Cell and an APC, the T-Cell scans the 
presented mixture of peptides using its receptors. There is only one receptor type on each T-Cell, but many 



1 . 1 Biological perspective 

Let us first of all say that the model we consider concerns only the mechanism of T-Cell activation. There 
are many other processes involved in the immune response that are not considered here at all. T-Cells have 
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different peptide types on each APC. The task of a T-Cell is to decide on the basis of signals received during 
an immunological synapse whether foreign antigens are present in the body, and to trigger an immune 
reaction when indicated (in reality, this involves a complex interplay with other parts of the immune system 
which we do not deal with here). This task is made difficult by the following fact: According to Mason 1 12| 
and Arstilla et al. Ill, there exist about 10^'^ peptide types that should be recognized but only 10^ different 
receptor types. This implies that a fully specific recognition, i.e. that each receptor type recognizes exactly 
one specific antigen, is impossible. Therefore, a certain degree of cross-reactivity has to be assumed. The 
presence of auto-immune diseases, allergies and heavy diseases caused e.g. by viruses and bacteria shows 
that the immune system faces a very difficult task. 



1 .2 The model 

In 1 19 1 and fSSl] a mathematical model was presented that allows to interpret the functioning of T-Cells as a 
statistical test problem. We briefly describe this model following 1.23 J . 



Characterization of the APC. An APC is characterized by the types of the presented peptides and the number 
of copies of each peptide type. Thus, each APC can be represented by a set of parameters Zj representing the 
numbers of peptides of type j. Peptide types are sometimes distinguished as constituent (i.e. being present 
in all cells) and variable (i.e. being present only in some cells), but this distinction plays no major role in 
the present paper (see, however [11 J). The index j ranges over all peptide types present on this APC. We 
denote the number of foreign peptides present on the APC by Zf and allow, for simplicity, for only one type 
of foreign peptides on one APC. Note that the peptides on the APC are collected by a given APC and thus 
are a random sample from the total pool of peptides present in the body. 

Characterization of the T-Cell. Each T-Cell possesses one receptor type on its surface. The T-Cell is charac- 
terized by the interaction of its receptors with the different peptide types. To each receptor type i corresponds 
a set of association rates, a^, and dissociation rates, r^ . Here i ranges over all receptor types and j over all 
peptide types. We use the following assumption in line with the approach in ll23l and ll20l . 

Assumption 1 There is an abundance of receptors on each T-Cell in the region of interaction with the APC 
such that each released peptide is immediately bound by a receptor again. Thus, association can be assumed 
as instantaneous and the association rates play no role. 

Remark 1 We see later that we have to work with random stimulation rates to investigate the event 
of T-Cell activation. It will become clear that the qualitative behaviour does not rely on their exact 
distribution, and thus assumption [l] is not too restrictive. 

Under Assumption fl] a T-Cell is fully characterized by the set r^j, where i refers to the particular T-Cell. 
We will see later that we are interested in the duration of peptide-receptor-complexes. These duration for a 
complex of type ij are denoted by tij and depend on the dissociation rates. 

Activation criteria. The interaction of the cells produces a stimulating signal which the T-Cell receives. This 
signal results in an activation of the T-Cell, if certain criteria are met. The papers II14II17|[T61 and ll22ll suggest 
the following assumption. 

Assumption 2 

1. A T-Cell receives a stimulus if a peptide-receptor-complex exists longer than a time t^. 

2. A T-Cell sums up all the stimuli it gets, even those of different receptors. 

3. A T-Cell is activated if the sum of all stimuli exceeds a threshold value gact- 

Note that activation is induced by stimulation and thus the stimulation rates Wij which result from a peptide- 
receptor-complex of type ij are important. The central quantity to analyse is then the total stimulation rate 
a T-Cell receives. It is denoted by gi and compounds all the parameters mentioned before. 

The preliminary description demonstrates that we are concerned with a really high-dimensional problem 
and model. Merely the number of parameters for the dissociation rates in the model is at least of the order 
10^° (10^ receptor types times 10^"^ peptide types). 
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Stochastic model. The large number of parameters in this model makes a specification of all of them impos- 
sible. Therefore, van den Berg et al. EOl proposed a stochastic model. 

A closer look at the system reveals that we must deal with different types and sources of randomness. 
This is crucial for the interpretation of the probabilities in specific experimental setups. The dissociation 
rates are assumed to be random because they are unknown in detail. The presentation profile, i.e. which 
number of which peptide type is present on an APC, is random because it represents a random sample of the 
peptides present in the body. In fact, this quantities are doubly stochastic because we have a random pool of 
peptides in the body and therefrom a random sample is presented on the APC. For the sake of simplicity we 
do not model this double effect here. A further reason to assume randomness is that we consider a random 
encounter of two randomly meeting cells. Let us now specify the stochastic version of the model precisely. 

Let (i7, J^, P) be a probability space on which we define the following random variables. First, we 
characterize the presentation pattern on the APC. 

(i) We set n = nc + nv + 1, where Uc and Uy denote the number of constitutive and variable peptide types 
on each APC. 

(ii) Nc and Ny denote the number of constitutive and variable peptide types in the body. 

(iii) The sample of constitutive and variable peptides is represented by positive random variables Z^ and 
Z"", each of them representing the number of copies of a certain peptide type; they are independent and 
identically distributed (i.i.d.) in the class of constitutive and variable peptides, respectively. 

(iv) Zj and Z" are bounded, and independent of each other 

(v) Z is the cr-algebra generated by Z| with j G {1, . . . , N^} and ZJ with j e {1, . . . , Ny}. 

(vi) We use a short hand notation for the conditional distribution P^(yl) = P(A|Z) and denote the corre- 
sponding conditional expectation by E^ [•]. 

It is reasonable to assume that Z'j and Z" are bounded because the space on each APC is bounded. 
Next, we come to the characterization of the T-Cell. 

(i) The total number of the receptor types is A^i e N, the one of the peptide types is N2- 
(ii) The index i denotes the receptor type and j denotes the peptide type, 
(iii) The dissociation rates of a complex of type ij are positive, i.i.d. random variables Rij with distribution 

V and expectation value E[i?y] —t,tG M+. 
(iv) TZ is the cr-algebra generated by Rij with i e {1, . . . , iVi} and j e {1, . . . , A^2}- 
(v) The times a certain peptide-receptor-complex of type ij exists are positive i.i.d. random variables 
Tij with conditional distribution P^(Tij e A) = P{Tij e ^l?^) and corresponding conditional 
expectation E^[Tij] ee E[T^j\n] = 1/R^j. 

In this setting Tij are random variables in a random environment, in other words they are also doubly 
stochastic . They display the individual duration of a concrete peptide-receptor-complex of type ij . Therefore, 
the expected duration of such a bond should be reciprocally proportional to the corresponding dissociation 
rate Rij. According to (iv) E[Tij] = (f)~^. The joint distribution of all dissociation rates is given by the 
product measure "P^i^a 'pjjg presented sample is from the biological point of view independent of the 
dissociation rates and the duration of the peptide-receptor-bonds. Thus, Z and TZ are independent. 
The parameters of the previous part can be considered as realizations of these random variables. 

The stimulation rates. According to Assumption [2] a single bond between a receptor i and a presented 
peptide of type j during a synapse results in a stimulation signal if the binding time, Tij, exceeds a certain 
threshold t* . It is assumed that the relevant signal for the T-Cell is the compound average number of stimuli 
during a synapse, 

iE,., 1ts>., (1-1) 

where Nj (t) denotes the total number of bindings with a peptide of type j and T/! are the respective binding 
times. Assuming that the number of bindings is very large, it is reasonable to assume that the relevant signal 
a T-Cell is receiving from a given peptide type is given by 



1 jv,(t) _ p^(r,j > t,) 



^^^- ^ 1™ 7 L.-, 1t/.>u - ^niL . *' a.s., (1.2) 



where the last equality follows from elementary renewal theory. We will henceforth consider the random 
variables Wij as the fundamental characteristics of a peptide-receptor interaction. 
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Notation. Let the variable Zf denote the number of presented foreign peptides of one particular type. 
The expected total number of peptides present at one APC is given by ni\i = ncE[Zf] + ni,E[Z|']. 
The factor q„ = (um — ^f)nj,j used in the following ensures a proportional displacement of the 
presented self peptides by the foreign peptidetQ 

We drop the index i in favour of a clear notation since it is fixed for one T-Cell. The discussion above 
motivates Zint et al. [23 1 to define the total stimulation rate as follows. 

Definition 1 With the notation introduced above, the total stimulation rate a T-Cell receives is 
given by 

E^^^^+ E Z]wA+ZfWf. (1.3) 

Note that Wf has the same distribution as the other random variables representing stimulation rates, and is 
independent of these random variables and Z. 

Interpretation of the probabilities. The central quantity for our investigation is the probability of T-Cell 
activation. If we consider just one certain encounter of an APC and a T-Cell the "probability" of T-Cell 
activation is either or 1. But one single experiment does not give much information on the actual situation 
because it is possible that this T-Cell performs a mistake. So, we and in fact the immune system take another 
view to the scenario. We are concerned with investigating the following three different cases: 

1.) The annealed case: Here we consider the total probability that any T-Cell is activated by any APC, 
V{Gn{zf) > gact)- This probability can be interpreted as the frequency that an activation occurs at all. 
If m denotes the number of observed experiments, this is the number of meetings of APCs and T-Cells, 
then 

-# {activated T-Cells} ^ P(G„(z^) > g^ct)- (1-4) 

m 

2.) The case quenched with respect to (w.r.t.) the dissociation rates of the T-Cell: In this case we investigate 

the conditional probability that a certain T-Cell type is activated by any APC, P^(G„(z/) > gact)- Here, 

one type of T-Cell is examining several peptide samples and the results are averaged. The probability 

represents the frequency of activations of T-Cells of type i during different experiments. If m denotes 

the number of meetings of T-Cells of type i and different APCs , then 

— #{activations of T-Cells of type i} -> P'*(G„(z/) > gact)- (1-5) 

3.) The case quenched w.r.t. the environment presented by an APC: The conditional probability that a certain 
APC activates any type of T-Cell, V^{Gn{zj) > gact), is analysed here. If m denotes the number of 
meetings of a certain APC with different T-Cells, then 

— #{activations of T-Cells by the given APC} -^ P^(G„(zf) > gact)- (1-6) 

m 

In ||23]| only Case 1.) is considered. In our view the Cases 2.) and 3.) are even more relevant. The probability 
in Case 2.) is to be interpreted as the frequency of activation of a given T-Cell in repeated synapses with 
different presenter cells. We shall see that these probabilities depend strongly on the sensitivity of the given 
T-Cell to the particular presented foreign peptide. The main question is which information on Zf is deducible 
from the behaviour of the T-Cells. This information emerges only on the basis of repeated experiments and 
the resulting frequency of activations. This leads to the question whether there exists a threshold value gact 
such that the presence of invaders is distinguishable from the self -background. 

Scaling and asymptotics. To obtain sensible analytic results one needs to consider certain numbers to be 
large. Clearly, the total number of peptide types, n = nc + n„ + 1, will be assumed the main large parameter. 
The number of foreign peptides has to be seen in comparison to this number, i.e. we understand that Zf 
depends on n, and one wants to know how large zj has to be (as a function of n) for a reliable detection. 
The numbers n^. and n^ can in principle also be of different magnitude, e.g. one may think that n^ ^ n and 
Uc ^ n^, /3 < 1. For simplicity, we will consider here only the case Uc ^ Uy ^ n. 



' One may argue that scahng the random variables by a common factor is not the best choice to achieve a constant expectation. E.g., 
if the Zj were assumed to be binomial random variables, it would be more reasonable to change the parameter in an appropriate way. 
However, this appears to have only little affect on the results and we keep following Zint et al. at this point. 
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2 Results 

2. 1 Expectation value and variance of the total stimulation rate 

We compute the expectation value and variance of the total stimulation rate as a function of the number of 
foreign peptides, Zf, in Cases 1.) , 2.), and 3.)- It is easy to calculate these quantities and they allow a first 
insight into why and in which range a foreign self distinction may work. We assume the existence of the 
involved first and second moments. 

Case 1.) According to the following lemma an increasing number of foreign peptides does not change the 
expectation value but the variance of the total stimulation rate. 

Lemma 1 It holds that E[G„(0)] = E[G„(z/)], and that 

V[G„(z,)] - V[G„(0)] ^ (vr,] + «)1) ., (., - ,,-a^ao, ) . (2.1) 

We can see that the expectation value of G„(z/) is independent of Zf. The variance depends quadratically on 
Zf, the difference of the variances as a function of Zf is a parabola with roots 2nMV[G„(0)]/(V[M^i]n|^ + 
V[G„(0)]) and 0. The function decreases first but for Zf large enough it is monotonously increasing and 
positive. The qualitative behaviour shows up in this general setup, the exact shape depends only on the first 
and second moments of the involved random variables and not on their exact distribution. In the setting of 
an increasing variance and a constant expectation value the probability to exceed a threshold value larger 
than the expectation value increases. Thus, a T-Cell activation may become more likely for a larger value 
of Zf. The increasing variance may allow to have the threshold value gact on a level such that permanent 
reactions are avoided but an activation becomes more probable. We see here already that a detection can 
only be possible if Zf is large enough. 

Case 2.) Because we consider conditional probabilities and expectations here, these quantities can coincide 
at most almost surely. We consider now the difference of the conditional expectations as a function of Zf, 
too. In this case the expectation value and the variance depend on Zf. 

Lemma2 It holds that E''^[Gn{zf)] -E^[Gn{0)] = Zf{Wf - E[T4^i]) almost surely, and that 

= 2 Zf{zf-2nM), (2.2) 

where Y^[X] := E^[(X - E^[X])2]. 

Note that V^[VKi] = because Wi is measurable w.rt 7^. E[VKi] is with positive probability not equal to 
Wf because we do not assume the stimulation rates to be distributed according to a Dirac measure. This is 
reasonable because we know from the biological background that the stimulation rates can vary. Hence, we 
have here an effect on the average stimulation rate a T-Cell receives which depends on the stimulation rate 
associated to the foreign peptide. The conditional expectation of G„(z/) increases with Zf if Wf > E[Wi]. 
The effect becomes enlarged for an increasing Zf. On the other hand, the variance is decreasing for an 
increasing value of Zf because Zf will never reach 2nM from the biological point of view. Thus, the situation 
is really different here. The parabolas describing the difference of the variances are random here. 

Case 3.) Again, the conditional expectations can be at most almost surely equal. Here, the expectation value 
is almost surely independent of Zf but the variance depends again on Zf. 

Lemma 3 It holds that E^ [G„ (zf)] — E^ [G„ (0)] ~ almost surely, and that 

where Y^[X] ■.= E^[{X -E^[X]f]. 
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Fig. 1; Intei-vals containing G„(z/) with probability and conditional probability 0.99 



In this case the conditional expectation of the total stimulation rate is only almost surely independent of Zf. 
We also have random parabolas for the difference of the variances in this case. Note that this scenario is very 
similar to the one in the annealed case; there is an effect on the level of the variance but not on the level of 
the expectation. 



In Figure [T] intervals are plotted for different values of zj, such that the random variable G'„(z/) lies 
in these intervals with probability 0.99, under the tentative assumption that the standardised version of the 
total stimulation rate, {Gn{zf) —E[G„(2:/)])(-^V[G„(z/ )])""'", is standard normally distributed. In Case 1.) 
the intervals enlarge for Zf large enough because the variance of G„(z/) increases. In Case 2.) the intervals 
shrink due to the decreasing variance but they move according to the value of the stimulation rate of the 
foreign peptide. In the presented case Wf is larger than its expectation E[W/]. Thus, the intervals move 
to the right hand side. Case 3.) is very similar to Case 1.) except for a slight difference concerning the 
expectation value which is just almost surely independent of Zf, and thus slightly varying. 
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2.2 Large deviations 

Encounters of T-Cells and APCs happen permanently in the body, but only very few of them give rise 
to an immune reaction. Therefore, T-Cell activation must be tuned (by suitable choice of the activation 
thresholds gact) such that activation and the corresponding immune response are rare events. This implies 
that to compute the activation probabilities, one needs to use large deviation techniques (see for example |6| 
or ||5l). In particular, the computations of means and variances from the previous subsection are insufficient. 
We are concerned with a family of real-valued random variables (S'„)„gN and the probability that n~^Sn 
exceeds a threshold value a, P(S'„ > na). A large deviation is a deviation from the expectation value of 
Sn of the order n. If the family of random variables under consideration satisfies a so called large deviation 
principle (LDP), the probability for a large deviation event decays exponentially in n with rate I (a). The 
rate function I (a) is obtained as the limit of the Fenchel-Legendre transformation of the logarithmic moment 
generating function of Sn, to wit I (a) — lim„-^oo Inia), where 

/„(a) = sup(ai? - ^nW) = ai9„ - if„(^„), (2.4) 

and ^nW = \ lnE[exp(?9S'„)] and ?9„ satisfies 

<(A0=«- (2.5) 

-dn is known as the tilting parameter and is used to perform an exponential change of measure in many 
proofs of theorems in this field. The standard theorems of Cramer and Gartner-Ellis then state 

P(S'„ > an) = exp(-?i/(a)(l + o(l))). (2.6) 

As was pointed out in Zint et al. |23|, this approximation is not sufficiently precise to calculate the actual 
probability because of the huge and poorly controlled multiplicative error term exp(no(l)). Fortunately, 
there are stronger theorems available, known as sharp large deviation results or exact asymptotics. This 
yields approximations of the form 

F(Sn > an) ^'-^^ftl^il + 0(1)). (2.7) 

with the same notation as before and cr^ = ip'"(i?„). The standard theorem for Sn a sum of i.i.d. random 
variables is due to Bahadur and Rao |2|. The generalization to independent, but not identically distributed 
random variables, which we need here, is based on results of Chaganty and Sethuraman 1 3 1 . We restate these 
results in Chapter[3] 

These techniques will allow us to achieve our goal, namely to check whether there is a threshold value 
gact such that the probability of activation changes by orders of magnitude with Zf. We would like to know 
if the condition P(G„(0) > gact) ^ P(G„(2:/) > gact) can be satisfied for physiologically reasonable 
values of Zf. Therefore, we look at the activation probabilities as a function of gact- These functions are 
often called activation curves. 

Application to the model ofT-Cell activation. We look at an artificial sequence of models which is charac- 
terized by an increasing number of peptide types, n = nc + n^ + l. We assume that there exists C E (0, oo) 
such that lim„_j.oo nc/n^ — C. This implies that there exist Ci, C2 G (0, 00) such that lim„^oo nc/n = Ci 
and lim„_j.oo n^/n = C2. We use this condition to ensure some convergence properties, especially for the 
rate function. But this is also from the biological point of view a reasonable assumption because the ratio of 
the numbers of constitutive and variable peptide types is constant. The sequence of random variables Sn is 
given by Gn{zf) from Definitionfl] 

In |23| certain distributions for all appearing random variables were used to prove the applicability of 
Theorem l4] below. Afterwards, the approximations of the probabilities were calculated and compared to 
simulations. Thereby a separation of the activation curves for different values of Zf was obtained and a high 
coincidence of the approximation with the simulation was observed. 

We prove the applicability of Theorem!?] in the Cases 1.), 2.), and 3.) in Section[3]under suitable condi- 
tions on the distributions and moment generating functions of the involved random variables. In this section 
we state just the approximations of the probabilities and the involved rate functions. 

Remark 2 To obtain the following approximations of the probabilities we assume the existence of 
the moment generating and the conditional moment generating functions of ZfWi, Z^Wi and Wi. 
This condition is satisfied since we consider bounded random variables. 
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Casel.) Let A'fc(^) = E[e''^i^i], M^{d) = E[e''^i^i] wAM{d) = E[e''^i] denote the moment generat- 
ing functions of ZfW^i, Z"Wi and Wi. Let 5act("-) = anfora > E[G,i(z/)]/n. The activation probabilities 
can be approximated by 



nGn{zf)>gact{n)) 



exp{-nail}nia, Zf) + ric In Mc(g„^n (a, z/))) 

X exp(ni, In M„(g„ ■!?„(«, z/)) + In Af (z/i?„(a, z/)))(l + o(l)), 



(2.8) 



where i?„(a, zj) is chosen such that the argument of the exponential function attains its minimum. We write 
z9„(a, Zf) and not just ?9„ to visualize the dependence on a and Zf. We have proven this approximation in 
analogy to the proof of Zint, Baake, and den Hollander in ll23l . It only requires homogeneous distributions 
with certain properties in each block. The form of the dependence of the stimulation rates on the dissociation 
rates is not important. This can be interpreted as a tacit inclusion of competition of the peptides, association 
rates, loading fluctuations and similar aspects. 

The interesting situation is when Zf becomes large, but remains small compared to n. In that case, the 
infimum will be attained for i?„ ~ a, and since the law of Wi is assumed to have bounded support, there 
will be a constant L such that ^ In M{zfd) ^ Lzj for ??z/ large enough. For zj ^ y^ we may apply this 
approximation since we know from the applicability of TheoremHthat "dnVn — > oo. A simple computation 
then shows that 

Tp/^ (L ^ r^^ - exp i?„ a, Q)zf L ~ a 



(2.9) 



where i?„ (a, 0) is the solution of Equation 2.5 for Zf — 0. This implies that if a is chosen sufficiently small. 



the activation probability increases exponentially with zj, as desired. To obtain Equation 2.9 we used two 



Taylor approximations: first we expanded z9„(a, Zf) in ??„(a, 0) and after plugging in this term in the rate 
function we expanded the resulting expression. This way it is possible to recover the probability of activation 
for the self -background and calculate the ratio of interest as stated by Equation |2.9| 

Remark 3 The fact that if a is too big, the activation probability drops as Zf increases has a simple 
intuitive explanation: for very large a, the contribution of the foreign peptides is limited by the 
maximal value of Wf, whereas the reduction of the contribution of the other peptides makes it 
more unlikely to achieve an activation by a random fluctuation. 



Case 2.) An upper index TZ on any previously defined object should signify the same object conditioned on 
the cr-algebra 7?.. We consider the conditional moment generating functions 



A/^,(i?)=E^[e 



i)Z''Wi 



expi^Z]W,)dPzj,je{c,v}, 



(2.10) 



where P^-y denotes the measure corresponding to Zj , 7 £ {c, v}. Thus, the moment generating functions 
are random variables themselves. Because the numbers of copies, Z'j and Z"", are independent of 7?. and 
the stimulation rates are measurable w.r.t. 7?., these moment generating functions are again i.i.d. random 
variables in each block. Due to the measurability of Wf w.r.t. TZ we have Af^(^) — exp(i9z/W/). The 
resulting rate function, I^{a, Zf), is also random. We need to apply a law of large numbers in the proof of 
the approximation of the probabilities in this case. Thus, ln(Af?^ (■(?)) e i^i^'p^i^^^'^ fQj- g^^-jj ^ > q and 
7 G {c, w} is an important ingredient for the proof. Using this fact we can also establish convergence of 
the rate function according to a strong law of large numbers. But this convergence is not good enough for 
our purpose because the rate function and thus also the error term arising from the law of large numbers are 
scaled with a factor n in the large deviation approximation of the probabilities. Therefore, we have to prove 
?L functional central limit theorem for a part of the rate function and we have to take into account the term 
which emerges therefrom. Thereby we obtain a process, Z„, which converges weakly to a Gaussian process. 
Let 



-E 



InE 



and i^o (a, Zf) be defined as the solution of 



n 



^■aziwx 



-E 



InE 



n 



^■dZlWi 



(2.11) 



^/ 



Wf 



M 



ffn(gn^). 



(2.12) 
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The (random) function /g (a, z/) = ai?Q (a, Zf) — gn('7n^o (a, Zf)) converges to a function /o(a)- Let 



ZnW 



1 






(2.13) 



With this notation the activation probabilities can be approximated almost surely according to 

exp{-ni;}{a,Zf) + y/nZn{qn'dl^{a,Zf)) + ZfWf'd^{a,Zf)+nRn) , , ,, 

= -„ — 7t^= (l + o(l))7 



cr„??„V27rn 



(2.14) 



where i?„ e O (i). In Section |3 



we prove the joint weak convergence of the process Z„(g„i^Q (a, Zf)) and 
its derivatives which establishesThat the expression for the probabilities is well-behaved. 

As in Case 1.), the interesting situation is when Zf becomes large but Zf/n is small. A computation 
similar as in Case 1 .) then shows that 



""^(G„(0) > 5act(n)) V I y J j^^^y 



(2.15) 



This shows that for a given choice of a, the activation probability increases exponentially with zj only if 
Wf is large enough (depending on the choice of a). That is, only the presence of foreign peptides which 
react strongly with the particular T-Cell will lead to an increased activation frequency. This implies a certain 
degree of specificity. Although we have seen in Case 1.) that the over all probability of T-Cell activation 
increases with z/, this is not true for any T-Cell type but just for those which are equipped with a large 
enough value of Wf. 

Note that the rate function in this case is random, that is the activation probabilities fluctuate from T-Cell 
to T-Cell by a factor of order exp(-ynZ), where Z is random. This implies that, for the modulation of the 
activiation probabilities due to foreign peptides to exceed these random fluctuations significantly, one should 
have that z/ ^ ^/n. This appears to give limitation on the sensitivity level for the recognition of foreign 
peptides. 



d ~ , „. d 1 



lnE[e- 



Case 3.) An upper index Z denotes the objects conditioned on this a- algebra. ??"(a, Zf) is the solution of 

^^zfWf] (2.16) 

where g„(z9) = ^E [inE^ [e'^^i^i]] + ^E [inE^ [e^'^i'^i]]. Z„(i?) is defined by 

1 



ZnW =- 



ric+rii; 



J2 (In M^^ (^) - E[ln M,^^- (^)]) + ^ (in Af,^^- (^) - E[ln M^^^ (i?)]) 

j=l j=na+l 



(2.17) 



and Z„(g„t9"(a, z/)) converges weakly to a Gaussian process, /"(a, z/) = ad^{a^ Zf) — gn{qn'd"'{<i, Zf)) 
converges to a function I (a). 

The notations and the proof of the result are quite similar to Case 2.) but we can recognize some struc- 
tural differences in the results here. We have seen these differences between the two conditional scenarios 
already in the analysis of the variances and the expectation values. Here, the probability of activation can be 
approximated by 



'(Gnizf) > gact{n)) 



exp 



(-n/"(a,z/) + y^Z„(g„79"(a,z/)) + lnE[p/"(°^-/)^/l^/]+ni?„), , ,^.. 
j== (l + o(l)j, 



,i^„V27rn 



(2.18) 



where i?„ e O (i) . In the rate function appears again a term which depends on the foreign peptide, namely 
lnE[exp (??"(a, Zf)zfWf)]. But in contrast to Case 2.) this term is deterministic and the only randomness 
in the rate function lies in the fluctuation term which arises from conditioning. Thus, we are again dealing 
with a random rate function but this function does not vary from experiment to experiment by a term which 
is scaled with Zf . 
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The interesting situation is again when Zf becomes large, but remains small compared to n. As in Case 
1.) there will be a constant L such that -^ \nM{zf-d) ^ Lzf for dzf large enough. A simple computation 
then shows that 

P-(G„M>,..H)_^^ /~„^^^^^^ /^_MV (2.19) 



''iGniO)>gact{n)) "V ^ ' ' ■* V riM 



where i?"(a, 0) is the solution of Equation 2.16 for Zf = 0. As in Case 2.), there appears a fluctuation term 
^JnZ. Therefore, we need again z/ ^ ^Jn such that the impact of this fluctuation term is not too big. This 
order of Zj ensures again that we may use the approximation ^ In M{zf-Q) ^ Lzj. 

Remark 4 During an infection the body is flooded with the invader. Therefore, a significant ratio of 
the peptides presented on the APC belongs to the foreign invader and it is reasonable to consider 
the regime Zf ^ y/n. A clear indication that a sufhciently high presentation level of the targeted 
peptides is needed, has been established by Landsberg et al. [TU] in the context of T-cell therapy 
of melanomas. Our work is based on the assumption Zf -^ n although parts of the results in Case 
2.) are also valid for Zf ^ n. If this is the regime of interest we suggest to use a convolution of the 
distribution of the foreign stimulation rate and the distribution of the part of G„ belonging to the self 
background since then the influence of the summand ZfWf is very large and this summand cannot 
be treated as the other summands. That is, one should consider P(G„(2/) > an) = P((7„G'„(0) > 
an — ZfWf) = /P(n^^G„(0) > q^^ia — n^^ ZfWf)\Wf)dPwf and approximate the probability in 
the integral suitably, depending on the value of q~^(a — n~^ZfWf). 



3 Precise formulation of the results and proofs 

To state the central large deviation result proven by Chaganty and Sethuraman in [3] we introduce some 
notation. Let {5'„}„gN denote a sequence of real-valued random variables with moment generating functions 

^ni-d) =E[exp(i?S'„)],z9eM and let !Z/„ be defined by !f„(t9) = iln<?„(i?). 

Assumption 3 There exist -d* e (0, oo) and /3 < oo such that 

\l'n{^)\ < PJoralMe B^)_^ = {i} e C : \i}\ < ^^}andneN. 
Notation. Let (a„)„gN be a bounded real valued sequence such that the equation 

an^KW (3.1) 

has a solution z?„ G (0,??**) with z?** G (0,i?*) for all n & N. a^ = lf'"(i?n) is the variance of the 
tilted version of n~^Sn and /„(a„) = a„T?„ — ^n{'&n) is the Fenchel-Legendre transform of <Z'„. We 
will abusively refer to this as the rate Junction. 

Theorem 4 (Chaganty and Sethuraman (3)) If in the above setting 

(i) hm„_>oo ^nV"' = oo 
(ii) infngN cr^ > and 
(Hi) lim„^oo V^sup^-^<|t|<5^^^ 

then 






= VO < 5i < (52 < OO, 



g-n/„(a„) 

' (5„ > na„) = ^= (1 + o(l)) ,71^00. (3.2) 



We give here the precise conditions that we impose on the distributions of the involved random variables to 
ensure the applicability of Theorem H with S'„ = G„(z/) in the three different cases. 
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Case 1.) This case has been considered in Zint et al. 11231 . We state their result under slightly more general 
assumptions. Below the quantities <?„, tf'„ and i?„ are defined as above with 5„ — G'„(z/). 

Theorem 5 Let (a„)„gN be defined by an = a and gact{n) — an such that gactin) > E[G„(zy)] and a < 
sup^gjj 2jtfVi(i?)/or all n S N. Then TheoremUlis applicable provided Zf/n ], 0, the distribution functions 
of ZfWi, Z^Wi and Wi are neither lattice valued nor concentrated on one point, and the corresponding 
moment generating functions Mc{'d), My{'d) and M{-d) are finite for each z9 G M. The rate function is 

Tl Ti \ 

In{a,Zf) = adn{a,Zf) ^ lnMc(g„i?„(a, z/)) ^ lnM„(9„i9„(a, z/)) In Af (z/?9„(a,z/)). (3.3) 

Proof The m.oinent generating function of the random variable G„ is given by 

<?„(^) = Af,(g„z9)"=M,(<7„z9)""M(z/^). (3.4) 

It reduces to <?„(i9) — Mc{'dy^''M-u {•&)"■" if Zf = 0. Assumption Is] is satisfied because the following 
holds: For each x € M+ and all i? < a; 



^n{^)< —\nM,{qnx) 



Ti 1 

— \nMy{qnx) + - In Af (zfx) 
n n 



< In Afc(a;) + In Ar^,(a;) + In Af (z/x) = P{x) 



(3.5) 



because ^nii^) is strictly increasing and qn,nc/n and n^/n are smaller than 1. i?„(a, z/) is defined 
as the (unique) solution of 



nc_ 
n 



dd 



lnA4(g„z9) 



n 



dd 



XnM^iqn-d) 



dd 



\nM{zfd) 



(3.6) 



This equation results from Equation|3.1|and the choice a„ = a. The solution exists since the function 



di? 



tf'„(^) runs from iE[G'„(z/)] =^^„(i?)|^=o to sup^g^ ^'^n(^) and a lies in between these 



values. It is unique because -4!Z/„(??) is strictly increasing. Because ^ — >■ Ci, ^f — ^ ^^2 



and 



1, Equation 3.6 converges. The hmit equation is a = Ci^ In Afcl^) + C2^ InM^id). Thus, 



there exists C G (0, oo) such that lim„_>oo '&n = C C is strictly positive because a > -^^n{'&)\^=o- 



Consequently, Condition ^ of Theorem 111 is satisfied. 
We define 

^1, 1„M.(„.») + ^1, lnM.(,„.) + i|, lnM(.,.)) \^^^^^^^^^ , ,3^7) 

This equation converges as the previous one and the second derivatives of lnAf^(q„i?), 7 e {c, u} 
are positive due to the strict convexity of these functions. Thus, Condition (In]) of Theorem l4] is 
satisfied, too. 
We define 



<(0 



A/^(q„(i3„(a,z/) +it)) 



,7 e {c, 1)} and j/"(t) 



M{zf{^n{a,Zf) + it)) 



(3.8) 



A/^(q„i?„(a,z/)) '' ^' J Ar(z/79„(a,z/)) 

These are the characteristic functions of the tilted random variables. The distribution functions 
corresponding to these characteristic functions are also neither lattice valued nor concentrated on 
one point. Because "dn{a^ Zf) — >■ C and (?„ — > 1, there exist e > and tiq < 00 for each t ^ such 
that for all n > hq 



We obtain 

^ni'&nia,Zf)+it) 



<P„i^nia,Zf)) 



K(i)| <l-e,7e{c,w}and|zy"(t)|<l 



Me((7„(^„(a, Zf) + ii))"=Af„(g„(i?„(a, z/) + it))''^ M{zfii}„{a, zj) + it)) 



<(!-£)' 



(3.9) 



Af,(g„i?„(a, z/))"= M,(g„^„(a, z/))«-' Ar(z/z?„(a, z^)) 
= |(^,"(t))"i<(i))"^^"(t)| 
1 

It remains to consider the supremum over the values oit in Condition (iii). Since i?„(a, z/) converges, 
the supremum is taken over a compact set. Thus, this function attains a maximum on this interval 
and this can be bounded according to Equation 3.9 Therefore, Condition ( pli| of Theorem [4] is 
satisfied and Theorem |4] is applicable. D 
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Remarks The case Zf ^ n requires a special treatment. This is, however, best relegated to the 
following Case 2.). 

Case 2.) We denote by Epjvijvj [•] the expectation w.r.t. the measure 'p^^^'^ which is the joint distribution 
of all the dissociation rates. We would like to show that the conditions of Theorem |4] are almost surely 
satisfied. As already mentioned it is important that ln(Af?,- {-d)) G L^ (j:>NiN2^^ where 7 G {c,v}. Under the 
assumption M^(i9) < 00 for each d e (0, i9**) we have that E'^[exp{-dZ'JWi)] € L^{r'^'^^). Combined 
with 

< ln(E^[exp(i?Z7iyi)]) < E^[cxp{i9ZjWi)], for i}>0, (3.10) 

this yields 

ln(M^/^)) = ln(E^[exp(^Z7W^i)]) e L\v''''''), (3.11) 

where 7 € {c,v}. Below we denote by 'F^, {}'^{a,Zf) = z9„(a, zy) the analogues of the quantities fpn, 
z9„(a, z/) under the conditional expectations E^. For notational simplicity we drop that superscript on 
dnia, Zf), but it is important to keep in mind that this is now a random variable, too. 

Theorem 6 Let (a„)„gN be defined by an = a and gact{n) — an such that gact{n) > E^[G„(2:/)] and 
a < sup^gjj -$7i\I/^ {•&) for all n G N. Then Theoremmis almost surely applicable if the distribution functions 
of the stimulation rates are neither lattice valued nor concentrated on one point and the moment generating 
functions M^^{i}), M^j{-d) and Af^(i?) as well as Mc(i?), My{-d) and M{-d) are finite for each z? G M. 
Then the rate function is 



jU 



(a, Zf) = {a -Wfj ^„(a, Zf) 



nc+n-u 



^lnAf5(9„z9„(a,z/))+ ^ lnM„'^^.(g„7?„(a,z/)) . (3.12) 

^J = l j=na + l J 

Proof Due to the monotonicity of the logarithmic moment generating function for each realization 
of the stimulation rates and the boundedness of all involved random variables we can find again 
l3{x) such that ^^{'S) < (i{x) for all d < x. Thus, Assumption ™ is satisfied. Recall that dnia^Zf) 
is defined as the solution of the equations 

Tic , ric+n^ J 

« = Ed^l^^^c''(9n^)+ E ^lnM;f(<z„^) + ^l^/. (3.13) 

j = l j=ric + l 

The solution 'i?„(a, z/) exists due to the choice of a and is unique due to the strict convexity of tf',^. 
For I? > we have, by the law of large numbers. 



lim - VlnA4^^-(g„79)+ V lnAf„^/g„ 



=CiEp«iiv, [lnAf;^i(z9)] + CaEpiViiv, [lnAf,^i(?9)] , V^^^^ - a.s. (3.14) 

Since the derivatives of the summands satisfy the bounds 



-InE [eM^q^ZjW,)] ^ E7^[exp(^,„z7W^l)] 



^ E^[eM^qnZ-lW,)] - '^"^^^1 ' ^'-''^ 

where z^'"^^^ denotes the maximal value of Z^ , 7 G {c,v}. Therefore they are integrable and 
hence the limit of the derivatives on the left-hand side of Equation p.l4| i exists and is equal to 
the derivative of the right-hand side. If either lim„_j.oo ^f/n = 0, or lini„_j.oo Zf /n = C > 0, the 
equations determining ^„(a, Zf) converge almost surely and therefore so does the solution ^„(a, Zf). 
Thus, Condition (|i| of Theorem [4] is again satisfied. 
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We have {-^^'^{'&))\^='d„(a,zf) > for each n due to the strict convexity of if"^. So, it remains to 
check whether this holds true in the hmit n — )■ oo. hm„_>.oo ^/p^^i'ff) exists because the sumniands 
of the derivative are again bounded and therefore integrable, since 

< i^lnE^fexp(^Z-VFi)] < 1nW?E^[iZ^,)'e^'^^' '^^^]E^[e^''^^'^^^] - E^jq^W.Zje^'"^''^^^? 



EKfgiJg^zZW'ija 



< 



Hn^'' I 1^1 j m, [e J _ ^2y^/2^77,Inax^^2 

EK[giJ9„Z7Wi]2 



= <ilwf{zi 



Thus, it is again aUowed to interchange hmit and derivative. Moreover, we have that 



M'' 



(CiEpiViiV2[lnA4^i(i?)]) = CiEpiVi«2 



■InA^'fiW 



>0 



(3.16) 



(3.17) 



because lnM^j^(??) is strictly convex. Thus, this summand is positive and, analogously, so is the 
second one. Therefore, Condition (lii| of Theorem H is satisfied. 

Next we check Condition ( pli| on the characteristic function. We have to take into account that 
Z'j and Z^ should be lattice valued random variables because they represent numbers of peptides. 
The characteristic function is given by 



'^'^{d + it) 



n;=i MS('Zn(^ + **)) n;=t;i MZ{<in{^ + ^t))M^{zf{d + *t)) 



<p^(^) 

We can rewrite 13.181 as 



n;=i M5(9„^) n;it;i M5(g„^)M^(z;^) 



(3.18) 



M5(g„(tf+»t)) 
Mf,.(<?„i?) 



:exp Uc ( EplViNa 



In 



ACi(9„(^+Jt)) 



+ n„ EpNiJVa 



In 



M;^i(g„(i?+»t)) 



This expression can be bounded from above by 



o(l) 



M5(g„(tf+»t)) 



M'^(z/(i>+it)) 



In 



M'^(gj(ij+it)) 



(3.19) 



exp[ne(ln(l-£)P^i^^ 



+ n„ ln(l-£)7' 



WiAf2 






*f^l («"'') 



< l-£ 



<l-e] +e 



For given e > 0, the probabilities 



7N1N2 






< l-e 



and P 



JViWa 






< l-£ 



(3.20) 



(3.21) 



are strictly positive, uniformly in n for n large, due to the assumptions on the distribution of the 
stimulation rates. Therefore, there exists J > 0, such that for all n large enough, p.20| i is bounded 
from above by 

exp((?i-l)(e-(5)). (3.22) 

Since e can be made arbitrarily small if n is large enough, 6 — £>Q for such n, and so this expression 
tends to zero with n exponentially fast. It is again a crucial point that 79„(a, z^) converges such 
that the supremum is taken over a compact set and Condition ([m]) is satisfied. D 

Remark 6 Note that it suffices in order to check these conditions to assume Ci + C2 > 0. It is not 
necessary that both constants are strictly positive. 
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Investigation of the rate function We are concerned with the behaviour of the large deviation rate func- 
tion and prove a functional central limit theorem with which we can characterize this. gn{qn'&) defined by 
Equation|2. 1 1 [converges to 



CiE 



InE 



n 



^■aziwx 



CoE 



InE' 



K 



^■aziWx 



9W- 



(3.23) 

The following 
theorem states our result. We use the short hand notation M^ {"' = In AfJ\(i?o(a))i where ^o(a) denotes the 



In the rate function appears the process Z„((7„i?q (a, Zf)) which is defined by Equation 
theorem states our result. We use the short 1 
limit of i^Q (a, Zf), the solution of Equation 



2.13 



2.12 



Theorem 7 If there exists a constant C such that g'niqn'^oio-, Zf)) > C > 0, the rate function is given by 



Cia,Zf)^lS{a,Zf) 



j=Zniqn^Ua, Zf)) - ^^0 (a, Zf)Wf + R„, 



(3.24) 



where Z„(q'„')?Q (a, 2;/)) converges weakly to the Gaussian process Za + Za and Rn E O (-). Za and Za 
are both Gaussian processes with expectation functions E[Za\ = = E[Za] and covariance functions 



Cov(Za.,Za' 



Ci(e 



MTZ.a Ti rTZ.a 
c,l ^^c.l 



and 



MlZ.a 11 rTZ.a' 
v,l '^Ka 



Cov(Z„ Z,0 = C2 (^E 
Remark 7 Tlic remainder term is given by 



E 



E 



M, 



U.a 



c,l 



M 



U,a 



E 



E 



M, 



TZ,a' 



c,l 



U,a' 



Rn 



2n{g',[iq„§^{a, zj)) + -l^Z-iq^^^{a,Zf))) 



M, 



1 

o I - 

n 



(3.25) 



(3.26) 



(3.27) 



where the appearing process scaled with n converges weakly. Since we consider the regime Zf 3> \/n 
the term — i?q (a, Zf)Wf is of a higher order than the remainder. 

As we aheady mentioned in Section |2] we need this approximation of the rate function on the level of 
the central limit theorem due to the scaling with the factor n in the expression for the probabilities. In order 
to prove this result we show weak convergence of the involved random processes and derive then an ex- 
pression for the rate function. To establish the weak convergence of Z„(q„i?Q (a, z/)), Z'j(g„??Q (a, zj)) and 
Z''((7„?9q (a, Zf)) as well as their joint weak convergence as processes on the Wiener Space with parameter 
a we show convergence of their finite dimensional distributions and tightness. To prove tightness we use the 
Kolmogorov-Chentsov criterion from f9l. Formulated to our scenario we have to check the conditions 

1. Z„((7„'i9Q (a, Zf)) converges in finite dimensional distribution. 

2. The family of initial distributions, Z„(q„('i9o(e, Zf))), is tight. 

3. There exists C > independent of a and n such that 



E 



(Z„(g„^;}(a + h, Zf) - Z„(9„^^(a, Zf))^ < C\h\ 



Note that Condition |3]is fulfilled if 



E 



{Z:,{q„d^{a,Zf))y 



<c. 



(3.28) 



(3.29) 



The same criteria with Z„((7„i?g (a, Zf)) suitably replaced by the process under consideration can be used to 
prove the convergence of these processes. 

We can handle the constitutive and the variable part separately. It suffices to check the conditions for the 
constitutive part because the sum in the variable part is built analogously. The following results are taken 
from f7l|. We need this central limit theorem for triangular arrays to check Condition [T] 

Definition 2 A row-wise independent d- dimensional triangular array scheme is a sequence {K"') of 
elements of N = N \ {0} U 00 and a sequence of probability spaces (12", J"", P") each of one being 
equipped with an independent sequence {Xk)i<k<K" of M'^-valued random variables. 
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We restrict the scenario to row-wise independent schemes which satisfy 



J2 |E[/i(x^)]|<ooand 



l<k<K" 



l<k<K" 



E 



Ixlf Al 



< CO 



(3.30) 



for each n, where /i is a given truncation function. This condition does not depend on /i e Cf = {ft. : M'* — > 
Mf' bounded, compact support, h{x) = a; in a neighbourhood of 0}. 

Definitions A row- wise independent array (xj!) satisfies the Lindeberg condition if for all e > we 
have 



lim 



E 

l<fc<A'' 



E 



IXfePl{|x^l>e} 



= 0. 



(3.31) 



Of course, this implies J2k ^[\Xk\ ] < °°' provided Condition 3.30 



is satisfied. 



Theorem 8 We suppose that the d-dimensional row-wise independent array satisfies Condition 3.30 and the 
Lindeberg condition, and let ^" = X]i<fe</f" Xk- Then 

a) If €{£,"') — > /i, then ji is a Gaussian measure on W^; 

b) in order that >C(^") — )■ A/'(6, c), the Gaussian measure with mean b and covariance matrix c, it is neces- 
sary and sufficient that the following two conditions hold: 






<k<K' 



<k<K' 



n.j n.l 

Xk Xk 



where Xk denotes the l-th component ofy^. 

Using this theorem we can prove the following lemmata which we need to prove Theorem |7] 

Lemma 4 Z„(g„i?Q(a, zy)) as a process on the Wiener Space with parameter a converges weakly to a 
Gaussian process if there exists a constant C such that g'^[qn'd'^{a, z/)) > C > 0. 



In order to simplify the notation we define 



Y"' 



InE 



n 



^q„i>o{a,Zf)Z^Wj 



-E 



InE 



n 



^q„i)^ia.zj)Z;rW, 



(3.32) 



The constitutive part of the process Zn{q„d'^{a,Zf)) is given by Z„^c(gn^?o(a, ^/)) = ^I]"=i^a"r 
To prove this lemma we have to check Conditions [T] l2j and [3] First we investigate the finite dimensional 
distributions of Z„ c in the following Lemma B] Therefore, let < ai < • • • < a„i < oo, ai E M^m E N. 
We are interested in the limiting behaviour of ^" = J2i<i<K" X? with %" = ^(i^JJ ,,■•■: ^JL j) ^'^'^ 
j E {l,...,nc}. 

Lemma 5 Under the assumptions of Lemmam^ ^" = X]?=i X? converges weakly to a Gaussian vector with 
expectation and covariance matrix defined oy 



C^^ = Ci E 



Mff'M'ff' 



E 



M, 



TZ.aj 



E 



M. 



c,l 



(3.33) 



Proof We show that Theorem [8] is applicable in this case. We have 



E 



lnAf5(q„^^(a„z;))-E 



lnA4'^^.(g„i?S(a„z/)) 



< K^ 

n 



(3.34) 



where K is the global constant bounding each Af^- for i? E (0,1?**), independent of j. We have to 

is satisfied. Since |x"| < 2y^K there exists for each e > 



3.31 



check that the Lindeberg condition 

no G N such that |x"| < e for all n > uq. Therefore, each summand is for n > no and thus also 
the sum and the limit vanish. Part 1 of Condition 13.301 is satisfied because we consider centered 
random variables. Part 2 holds true due to 



^E[|x,"|']<max^|x,"|'<n, 



AmK^ 



< AmK < oo 



(3.35) 
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according to 3.34 There can only appear finitely many summands which are equal to 1. Condition 
[/3] of Theorem |8| is satisfied because each x" has expectation due to the construction. Condition 
[7] is satisfied smce 



Tic 



^3 ^j 



^ I E [\nM^,{q^d^iak,Zf)) InM^.iq^^'^iai, Zf))] 
- E [lnM,^i(g„i?^(afc, Zf))] E [liiM^,iq„d^{ai, Zf))] 



(3.36) 



Letting now n tend to infinity we obtain 



Ci(e 



Mff'Mff' 



-E 



M, 






E 



M. 



TZ.ai 
c,l 



(3.37) 



Limit and integral are interchangeable because dominated convergence is applicable due to the 
boundedness of the logarithmic moment generating functions for 1} e (0,19**). D 

To complete the proof of Lemma [4] we need to prove tightness. To do so, we use, as usual, the Kolmogorov- 
Chentsov criterion ||9] and check the Conditions [2| and |3 .29| 

Proof (Proof of Lemma Dl) The family of initial distributions is given by the random variables 
evaluated in iJq (^' -^/) f"^^' an e > because a > 7i~^E^[G'„(z/)] > 0. This family is seen to be tight 
using Chebychev's inequality: 

— J2 (lnAf5(<7„i?S(£, ^/)) - E [lnM5(g„,?S(£, Zf))]) > K 



i=i 



< 



i E;:i V[lnM5(g„z?^(£, Zf))] n, V[lnMjf,(g„t9^(£, Zf))] 



R-^ 



i^2 



(3.38) 



With ^ -^ Ci and qn -^ 1 exist 6, S and hq such that ^ < Ci + 5 and qn-do^s, Zf) < i^q (e, Zf) + S 
for all n > riQ. Thus, 



^V[lnA4'^^.(g„i?o"(£,z/))] < iC,+5MlnM:,\{^-{s,Zf) + S))] 
for all n > riQ and we obtain 

P I ^ f;(ln A.f5(<z„,9^(e, z^)) - E[lnAf5(g„i?o"(£, Zf))]) > K 

"^«-V [i;^] , (Ci + S)Y [inM^i^Ue, Zf) + 5)] 



(3.39) 



fCX max s max 

iGl,...,nn — 1 



(3.40) 



For each e we can choose i^ large enough such that 3.40 < e and thus we have proven tightness of 
the initial distributions. 

It remains to check Condition 13.291 We have 



E 



da 



Zn,c{qn'&o{a,Zf)) 



= E 



1 "<: /I 



E 



j=i 









-E 



=0 



[/ r/ 


\ 


2" 




yn 




\da 


0.,'J 
/ 





(3.41) 



because yj^^ and thus ^^^J^j ^'I'S centered i.i.d. random variables. They are independent for different 
j because ■!^g(a, z/) depends only on the random variable Wj by definition. Thus, it is enough to 
show that E [(^i^a"j)^] is bounded. Since E[X] and thus X - E[X] are bounded if the random 
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variable X is bounded it suffiees in our scenario to show boundedness of the uncentered random 
variable. Let X^j = 'ff^ia, Zf)Z]Wj. Then 



0<^ (InE^ 
da 



=9"^r 



<g„max ( -r^a,] 



E' 



n 






EK [e?-^?,] 



E^ 


'e^^^lo 


E^ 


[e'^"^?..] 



< g„Zi^^'"^'^T4^r"max - (7?JJ(a, Zf)) . 
a da 



We know by an application of the implicit function theorem that 

d 



da 



d^{a,Zf)^{g:{q^i9^{a,Zf))) \ 



(3.42) 

(3.43) 

D 



Thus, we get tightness if 5"(9ni?o (a, Zf)) > C > 0. 

With analogous calculations we get the convergence of the derivatives of Z„(g„t?g (a, z/)). 

Lemma 6 The processes Z^((7„'!?q (a, 2/)) and Z'^(qn'0Q{a, Zf)) as processes on the Wiener Space with 
parameter a converge weakly if there exists C > such that g'niqn'&o (O', Zf)) > C > 0. 

Proof In analogy to the previous proof we define 



K^y 



E 



n 



EK [e«"^"..] 



and 



ix:.)" - 



E' 



n 



(g„Z,-W^,)'e«"^"- 



E 



'e'^ 



E 



n 



EK [e9"^a„] 



(3.44) 



(Z„ZW,e«"^" 



EK [e«"^.".] 



EK [e^"^":-] 



-E 



E 



n 



(g„Z,=W^,)'e«"''"- 



E' 



K 



(7„^,^VF,e«"^»". 



^i"J 



EK [e«"^»".] I E^ [e«'-^">:'] 

The constitutive parts of the processes under consideration are given by 



(3.45) 



(3.46) 



j=i 



j=i 



With the notation (xj)' = ^{{Y^,^)', . . . ,{Y^^^^)') and (x,")" = -^^((r,"^,,-)", • • • , (>;ij)") 



•/" 



obtain |(x")'P < ^ [Z''{'^''''WY'''''f and |(x")'T < :i^(Zi '""'''' VFi'"'^'^)^. Hence, the convergence of 
the finite dimensional distributions of Z'^ ^ and Z'^ ^ follows with the same argument as before. 

Concerning the tightness of the initial distributions we get the two following bounds using again 
Chebychev's inequality: 






<K max < max 

igl,...,no 



E^[ZfiyieW(e.^/)+5)ZiWi 



(3.47) 



and 



E(^. 



(Ci + 5)W 



e.j)" >K I </f-^max 



max 



■nc{i) 



iel,...,no-l Z 



v[(r;,,)"]> 



E'^[(Z5=iyi)2eW(^'^/)+^)^i^i] /E'^[ZfVFieW(S'^/)+^)^i^i] 



EK[e('5o(e:Z/)+J)^?^il 



EK[g(i?S(e,^/)+<5)Z-Wi 



(3.48) 
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Using again Condition 3.29 it is enough to bound 



da E^[e'"^».^ 
and 




da 1 E^[e«"^".j] ^ E^[e'?"^-^] 

These derivatives are given by 



(3.49) 



(3.50) 



and 



(3.51) 



E^[(q„ZW,)2(^[?)'(a,z/)e«"^-]E^[q„Z,-l^,e«"^-.] 



(E^[: 



?„X'' 



^ E^[(^g)'(a, z;)g„Z|W,e^"^-".](E^[g„Z|Ty,-e^"^"..])2 
{E^[e'"^"-^])3 

We are able to bound the first derivative according to 



- max(z9^)'(a, z/)(Z,;''"""Tyj""'')" < 3.51 < max(t?JJ)'(a, z/)(Z;''"""W7"'^)^ (3.53) 



Thus, we get the same criterion as in Lemma |4] for boundedness and thus tightness here. We 
can bound each summand in |3.52| very similar to the previous cases and end up with the bound 
Amaxai^QYia, Zf){Z'i'™'^^W"^'^^)^ . Thus, we again arrive at the same criterion to get tightness. D 

Lemma 7 X^ = {Zn{qn'&oia,Zf)),Z'^{qn'&Q{a,Zf)),Z'^{qn'dQ{a,Zf))) converges weakly if there exists 
C > such that g'^{qn'dQ{a, zj)) > C > 0. 

Proof The structure of the proof is the same. First, we consider the finite dimensional distributions. 
Let yj^^, iXa,])' *iid ^a,j)" be defined as above. We investigate now 

X" - ^(XLv (^.)'' (^.)"' • ■ • '^,r (^.)'' (^.)")- (3.54) 

These vectors are again independent for different j. The boundedness of |x?P follows directly by 
the boundedness in the previous cases. Again, |x?| tends to such that the Lindeberg condition 
is satisfied. Part 1 of Condition 13.301 holds because we consider centered random variables. Part 2 
holds because we can bound |Xj P- It can be shown that the initial distributions are tight using 
again Chebychev's inequality. To prove tightness we have to show that 

=E [(Z„(g„^^(a + h)) - Z,,{q„i)^{a, Zf))f + (Z;(g„^'J(a + h)) - Z'„{q^dZ{a, Z}))f 

+ {Z';{q^d^{a + h)) - Z::(g„i?^(a, Zf))f] < C\h\^ (3.55) 

This holds true because we have already seen that each summand can be bounded by the right-hand 
side for a certain C. Thus, wc have just to sum up the different constants. D 

Finally, we come to the proof of the result about the rate function. 
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Proof (Proof of Theorem 10) We look at the dnifl, Zf) determining equation 



Zf 1 

n \/n 



(3.56) 



and write the sohition of this equation in the form dQ{a, zf) + (5"(a, z/), where i^q (a, Zf) is defined 
as the solution of 



z f 
a- —Wf = q^g'Jq„i'}). 
n 



(3.57) 



5" (a, Zf) denotes the stochastic perturbation of this equation caused by the process Z„. By definition 
of dQ{a,Zf), we have qng'n{qni!^o i^^ '^f)) ~ o, ^ ^//"-^Z- We can derive an expression for (5"(a, zy) 
using a first order Taylor expansion. To keep the notation short we drop the arguments and write 
just x^Q and 5". We obtain 



Z f 

a ^-Wf =g„ 

n 

n 



g'niqn^'S) + 9n^".9:;(^0) + -^ Ki^n^o) + qnS'' ^Z^iq^^^) + 0(6^ 

\ n \ n 






^5" = 



1 Z;(q„^«)+o(<5") 



\/rL 



\Z'^{qnn) 



,/n 



qnig'^iqn^) + :7^Z-{q,,^^-)) qnig'^A^n^) + ±Z'^iqn^^^)) 



o(n- 



(3.58) 



The rate function can be rewritten as 



da, Zf) = aKia, Zf) - ^'^{^n{a, Zf)) 



{a-'-lWf) 



= [a- -^Wf ) [d^i + r) - 5„(g„(^S + n) - 



= ^n(9nW'+<5")). 



(3.59) 



A second order Taylor expansion and reordering of the involved terms yields 



/^(a,z/) ^adZ~gn{qnn)--Wfd- - ^Z„(g„^^) + (U-'-^Wf) - qng'Mn^'',)) &'' 



1 



=0 



g„5"Z;((7„^^) - hqnSn^ (g':{qn^o) + ^^;'(9„^o") ) + °((9n^")') (3.60) 
n 2 \ Jn ' 



The stochastic process Z„(g„'i?Q(a, z/)) converges weakly to the mentioned Gaussian process ac- 
cording to Lemma I4] g'/-^ is of the order 0(1) according to 



g'^iqnd) ^ CiE 

+ C2E 



(E^[e'»^f^i])2 
(E^[e''^i^i])2 



(3.61) 



Together with the joint weak convergence of the processes Z'^{qn'&Q{a, Zf)) and Z^'(9„i?g (a, Zf)) this 
yields 6" G 0{l/^/n). Furthermore, this implies / € o(l/-yn) for each / G o{6") and / G o(l/n) for 
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each / G o(((5") ). We plug in Equation 3.60 the expression for J" and obtain 



Cia, zf) = /„"(a, zf) ~ '^Wf^^ - 4^Z„(q„^^) 



rnl -ij^Zni<ln^0)) , .,„.V /....,„. , 1 



2 Un(5^'(gn^^) + ij^;'(g„^[f)) ' V V^"' "' V^ 






9?. {g'^i^uK) + 7^^;'(9n^S)) 9„ (<'(g„i?S) + ^.z'^i^inK) 



+ o{{5-Y) 



"^ "^'^"^°^^ +^Z;(g„z?^)o(,5")+o((<5")2). (3.62) 



According to the observations concerning 5" this equals 

where ^77 — L? "1 °J,,/ — sttt converges weakly due to continuous mapping and the joint weak con- 
vergence of Z,'j(g„i?g) and Z"(g„'i9o). This completes the proof of the theorem. D 

Case 3.) Conditioning on Z produces again i.i.d. random variables because Zj are measurable w.r.t. Z and 
the stimulation rates are independent of this cr-algebra. 

Theorem 9 Let (a„)„gN be defined by an = a and Qact{n) — (in such that gact{n) > E^{Gn(zf)) and 
a < sup^gR -^^^ {■&) for all n S N. Then TheoremHhs almost surely applicable provided zj /n ^ 0, the 
distribution functions of the stimulation rates are neither lattice valued nor concentrated on one point and 
the moment generating functions M^,j{-d), M^j{d) and M^{-d) as well as M^'d), My{'d) and M{'d) are 
finite for each i? G M. The rate function is 



1 /"" 

Inia,Zf) =ai}„{a,Zf) - - ^ In A/f^(9„t9„(a, z/)) 



nc-\-nij 

+ J2 lnM,^^-(g„i9„(a,z/)) + lnM(z/^„(a,z/)) I . (3.64) 

Proof The proof of this theorem goes along the same lines as the analogous result in Case 2.) and 
will be skipped. D 

Investigation of the rate function. In this case, the properties of the large deviation rate function can again be 
described by a functional central limit theorem. Using the notation from Section [2] we obtain the following 
result 

Theorem 10 If there exists a constant C such that 5"(g„?9"(a, Zf)) > C > 0, the rate function takes the 
form 

I^ia,Zf) =7"(a,z/)- ^Z„(q„79"(a,z/))- -lnAf(z/i?"(a,z/)) + i?„, (3.65) 

\/TI Tl 
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where Rn G O (^). Z„(g„i?"(a, Zf)) converges weakly to the Gaussian process Za + Za- Za and Za are 
both Gaussian processes with expectation functions V\Za] = = ]E[2^a] and covariance functions 

CoY{Za,Za') = C (E[ln(E^[e''«(°)^i'^i])ln(E^[e''°('^')^i^i])] 

-E[ln(E^[e''»(")^i^i])]E[ln(E^[e''»("')^i^i])]) (3.66) 

and 

CoY(Za,Za') = (7(E[ln(E^[e''«('^'^i''^i])ln(E^[e''«(°')^i'*'i])] 

-E[ln(E^[e''«(")^i^i])]E[ln(E^[e''°('^''^i'^i])]) . (3.67) 

The approach to prove this resuh is the same as in Case 2.) and we will not present the details. 

4 Conclusion and Outlook 

The main new aspects of the present work are the investigation of the conditional scenarios and the estab- 
lishment of a higher robustness of the model using classes of distributions instead of concrete distributions. 

The first point allows a more precise understanding and interpretation of the activation mechanism. The 
presented results show that the parameter a can be chosen such that the activation probabilities increase 
exponentially with Zf for the regime ^/n -^ Zf -^ n in Cases 1.) and 3.). For Case 2.) this result depends 
on the actual value of the stimulation rate of the foreign peptide, Wf. This value has to be large enough in 
order that the probability of activation increases exponentially with Zf. In biological terms this means that 
the over all frequency of T-Cell activation increases as desired. This growth is caused by those T-Cell types 
which interact strongly with the given foreign peptide type. These findings are on the one hand very suitably 
captured by the title of recent work of van den Berg et al. |2]J "Specific T-cell activation in an unspecific T- 
cell repertoire" and on the other hand justify this formulation. This interpretation also suits to the histograms 
of stimulation rates and the explanation of these histograms in ifTTl . 

The generalized distribution assumptions allow the choice of different distributions depending on the 
mechanisms that should be included into the model. For example MHC-loading fluctuations, influence of 
different affinity of the different peptide types to the same receptor and maybe co-stimulation could be 
considered. For the relevance of co-stimulation in the immune response see, e.g. the recent review by Chen 

m. 

One aim of future work is to investigate the mechanism of negative selection. Thereby T-Cells which 
interact too strongly with the body's own structures are deleted. Van den Berg et al. investigated this mech- 
anism in a different model setting in ifTSl and also Zint et al. Il23l included negative selection into numerical 
simulations. In this case the situation becomes mathematically more involved, since the selection causes 
dependencies in between the stimulation rates as was already pointed out in 1.15.1 . 
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